Knowledge and ignorance in incomplete quantum state tomography 
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Quantum state reconstruction on a finite number of copies of a quantum system with information- 
ally incomplete measurements does, as a rule, not yield a unique result. We derive a reconstruction 
scheme where both the likelihood and the von Neumann entropy functionals are maximized in order 
to systematically select the most-likely estimator with the largest entropy, that is the least-bias 
estimator, consistent with a given set of measurement data. This is equivalent to the joint consid- 
eration of our partial knowledge and ignorance about the ensemble to reconstruct its identity. An 
interesting structure of such estimators will also be explored. 
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The acquisition of information about a given ensemble 
composed of many copies of a quantum system, which is 
used to identify its quantum state for instance, always de- 
pends on the way the measurements are performed. Such 
a protocol of probing the ensemble is known as quantum 
tomography. Since our measurement resources are always 
Umited, the information is never complete and the infer- 
ence of the nature of this ensemble from the measurement 
data needs to account for the fact that some aspects of 
the ensemble are fully characterized while others are not. 

This is especially the case if the ensemble to be charac- 
terized is complex and no feasible set of informationally 
complete measurements is available. In this case, some 
aspects of the quantum ensemble are not measured and 
reconstructions of its properties are not unique. For ex- 
ample, the true quantum state of a mode of light made 
up of an ensemble of photons can be described by a sta- 
tistical operator ptruc residing in an infinite-dimensional 
Hilbert space, and no matter how ingeniously a measure- 
ment scheme is designed to probe this ensemble, an infi- 
nite amount of information about it will always remain 
unknown. 

The standard approach to this problem is to apply an 
ad hoc truncation on the Hilbert space and perform the 
state reconstruction in a particular subspace. This re- 
sults in a smaller number of unknown parameters that 
can then be uniquely determined by the measurement 
scheme. Since the truncation is largely based on the ex- 
perimentalist's intuition about the expected result, that 
is the true state that describes an ensemble of infinitely 
many copies of such quantum systems, this cannot be 
an objective method A more objective alternative is 
to consider the largest possible reconstruction subspace 
which is compatible with any existing prior knowledge 
about the ensemble. For example, if an experimental- 
ist has prior knowledge of the range of the energy spec- 
trum of a particular quantum ensemble, he should con- 
sider the largest possible reconstruction subspace that 



contains quantum states describing the ensemble in this 
range of energies. This inevitably introduces more un- 
known parameters which cannot be unqiuely determined 
by the measurements and we should select the state es- 
timator in this subspace that is least biased. In this Let- 
ter, we show how to carry out this procedure usingtwo 
celebrated principles — maximum likelihood (ML) 
and maximum entropy (ME) They will be utilized 
concurrently to yield a unique and objective state re- 
construction scheme, the maximum likelihood-maximum 
entropy reconstruction scheme (MLME), a synthesis of 
knowledge and ignorance. This scheme will be appli- 
cable for any general set of quantum measurements, in 
particular for those which are non-commuting. 

By maximizing the likelihood, information is extracted 
from the measured data (knowledge) in an optimal 
way 0] and an estimator for the true state describing a 
given ensemble of quantum systems can be reconstructed. 
However, due to the informational incompleteness of the 
set of measurements, there exists in general a convex set 
of estimators which are all maximally likely. We shall 
select from this set the estimator with the largest en- 
tropy, which represents the least-bias guess of the true 
state consistent with the measurement data. As both the 
entropy and likelihood functionals are convex, our pro- 
posed reconstruction scheme will guarantee a unique so- 
lution even for informationally incomplete measurement 
schemes. This reconstruction procedure minimizes the 
spurious details coming from the parameters which are 
not uniquely determined by the measurements. 

Given a source that produces identical copies of quan- 
tum systems, each in the state described by the statis- 
tical operator ptrue, one can perform measurements on 
N such copies. These measurement outcomes are de- 
scribed by a set of positive operators Hj that compose 
a POM (probability operator measurement), with j run- 
ning over all outcomes, and a list of outcome occurrences 
Uj, ~ correspondingly the measured fre- 
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qucncies fj 



N ' 



is the raw data obtained from these 



measurements. Throughout this analysis, we shall as- 
sume that the measurements are perfect in the sense of 
■ n ," = 1. The next step is to infer the unknown state 
Ptruo from the measurement data. For this, we look for 
estimators — the ML estimators — which maximize the 
likelihood functional defined as 
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with pj — trlpllj}. Using this method, one can always 
obtaiiipositive estimators suitable for statistical predic- 
tions [5[. 

A POM is informationally complete if the mapping of 
statistical operators p on the probabilities pj is injective. 
This results in a unique operator pml ^ that maxi- 
mizes where a hat is used to denote an estimator. 
We shall focus on the case in which the POM is not infor- 
mationally complete. This means that there is a convex 
set of estimators that maximize C{p) for a given set of 
fjS. The next task is now to choose the one (the MLME 
estimator) which maximizes the von Neumann entropy 
S{p) = — trjplogp} over the set of ML estimators. It 
is well-known that the ME estimator is of the form 



Pme 
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with real pjS. Note that our proposed strategy is fun- 
damentally different from the standard ME scheme. The 
latter involves searching for the ME estimator puE which 
produces the measured frequencies of POM outcomes or 
moments of a particular observable and maximizing its 
entropy subjected to these linear constraints 0, Q . Since 
these constraints can become incompatible with one an- 
other due to the presence of statistical noise, this proce- 
dure may fail because there simply is no ME estimator. 

In our current strategy, instead of taking the data as 
strict linear constraints, information is extracted from 
the data via the nonlinear ML technique and mapped 
onto the convex subset of quantum states constituting 
the plateau of the likelihood functional. Thereafter, the 
entropy is maximized to yield a unique unbiased esti- 
mator. Conceptually, this can be turned into a convex 
optimization problem of maximizing the objective func- 
tional 



Ii\;p) = XS (p) + - log C{p), A>0, 



(3) 



which involves a single parameter A as a weight on 
S{p). This I(A; p) is the amalgam of two separate mea- 
sures of information and so reflects the joint consider- 
ation of knowledge and ignorance. In fact, we note 
that up to an irrelevant additive constant, I(A; p) is a 
weighted sum of two entropies: S{p) which quantifies the 



"lack of information" and the negative of S{{fj}\{pj}) = 
J2j fj^'^sifj/Pj) (relative entropy) which quantifies the 
"gain of information" from the measurements. Hence, 
Eq. ([3]) is indeed a natural combination of two comple- 
mentary aspects of information. 

Since bC{p)/bp ~ due to the constraint of maximal 
likelihood, varying I(A; p) with respect to p for fixed A 
gives 
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In order to maximize I(A;p), we need the derivative to 
be zero and this is obtained only when A — > 0. In other 
words, in order to search for the MLME estimator via 
Eq. ([3]), it is necessary to take both of our knowledge 
and ignorance of the unknown true state into considera- 
tion in such a way that our ignorance takes an infinites- 
imal weight. Geometrically, the log- likelihood functional 
is much larger than the entropy functional in this limit 
and since the log-likelihood functional has a plateau cor- 
responding to the convex set of most-likely estimators, 
a tiny admixture of the entropy functional introduces a 
gentle convex hill top within the plateau to select the 
maximum entropy estimator. 

By parameterizing a given statistical operator p as 
p = A/ti{A^ A} to ensure positivity, one can invoke 
the method of steepest ascent and derive an iterative 
routine for MLME. Here, we simply mention that one 
can start from the maximally-mixed state and iterate the 
equations 
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(1 + eTk) pk (1 + eTfe) 



tT{{l + eTk) pk {I + eTk)} ' 
Ifc = - 1 - A {log Pk - tr{pk logpfc}) 



(5) 
(6) 



with a step size e and Rk — J2j i^hlP^P^ ^j, until the 
extremal equations Tk'Pk' — Pk'Tk' = are satisfied for 
a particular k ~ k' with some numerical precision. We 
denote this operator as the MLME estimator pmlme = 
Pk'- If A = 0; we recover the ML iterative scheme [sj. 

Wc compare the MLME scheme with the standard 
ME scheme using the simple example of a trine POM 
defined by the three outcomes Hq = (1 -I- crz)/3 and 
n± = (1 ± \/3ax/2 — (72/2)/3, where cr^ and are 
standard Pauli operators. A straightforward calculation 
shows that when no — 6, n+ ~ 2 and n_ = 1 af- 
ter measuring = 9 copies for instance, the standard 
ME scheme fails as no quantum state has the frequencies 
/o = 2/3, /+ = 2/9 and /_ = 1/9 as probabilities. On 
the other hand, the MLME scheme still gives a positive 
estimator described by the Bloch vector (0.194,0,0.981) 
for those frequencies, thus showing its versatility. Only 
when the frequencies are probabilities giving positive es- 
timators may we use the ME scheme and in this case, the 
MLME scheme naturally incorporates these constraints. 
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To discuss the method of choosing A, we shah apply the 
MLME scheme to homodyne tomography, a technique 
which is used to reconstruct quantum states of hght @ ■ 
This is typically done by measuring a POM which resem- 
bles a set of eigenstate projectors \x^) of quadrature 
operators Xcosi? -I- Psini? for various d values, where 
X and P are respectively the position and momentum 
quadrature operators and x and ■)? are parameters speci- 
fying these projectors. It is clear that a finite set of such 
measurements is never informationally complete in the 
infinite-dimensional Hilbert space and thus the MLME 
scheme is necessary to obtain a unique estimator. Fig. [1] 
shows the dependence of log {C{p)) /N and S{p) on A 
such that 6I(A — >■ 0; p) = 0. In practice, A can be chosen 
from a range near zero, within which log(£(p)) /N and 
S{p) remain almost constant. 
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FIG. 1: A simulation on quantum tomography on a randomly 
generated mixed state of light in the five-dimensional Fock 
space. In this plot, the number of copies of quantum systems 
measured is fixed at N = 10*. A choice of 20 quadrature 
eigenstates made up of four different ■& settings, with five x 
values corresponding to each setting, which are projected onto 
this space was used and state estimators are constructed for 
difi'erent values of A. As A decreases, both the entropy and 
likelihood functionals approach their respective optimal val- 
ues obtained from MLME (i.e. when A ^ 0). When A is 
zero, there is a convex set of estimators giving the optimal 
likelihood value. For very large A values, the estimators ap- 
proach the mciximally-mixed state and hence S{p) approaches 
the maximal value log 5. 

Homodyne tomography is commonly used not only in 
quantum tomography on the true state, but also in quan- 
tum diagnostics where a given true state is to be classified 
as being classical/non-classical or separable/entangled. 
A typical quantity which is often investigated as an indi- 
cation of whether an unknown true state is non-classical 
is the value of the Wigner functional at the phase space 
origin evaluated with a reconstructed estimator p for the 
unknown true state. This is defined as W^o = StrjpT'}, 
with V = J dx jxij) (— x^l for any 1). In particular, a neg- 
ative value for Wqo implies that p is a non-classical state. 
To obtain an estimator p, one would need to choose a 



subspace from the infinite-dimensional Hilbert space in 
which the reconstruction procedure is tractable. This 
means that the value of Woo will depend on this trun- 
cation, which in turn relies on the prior knowledge one 
has about the true state. Using our scheme, we perform a 
simulation, shown in Fig. [51 to illustrate this dependence. 
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FIG. 2: A simulation on quantum tomography on a randomly 
generated mixed state ptrue of light in the 20-dimensional 
Fock space with a sfightly positive Woo — 0.141 . Du- and 
Woo respectively denote the trace-class distance between the 
reconstructed estimator and the true state and the Wigner 
function at the phase space origin, both averaged over 50 
experiments with A'^ — 10*. The same set of 20 quadrature 
eigenstates as in Fig.[TJ projected onto this space was used and 
this set of measurements is informationally complete in the 
two-, three-, and four-dimensional Fock subspaces (shaded 
region). The values Woo and Dtv were obtained by ML 
in subspaces of dimensions two to four, and by the MLME 
scheme in dimensions greater than four. The plot shows a 
strong dependence of Woo and Dtr on the subspace dimen- 
sion. In this case, it is obvious that the negativity of Woo 
inferred by a reconstruction in a subspace too small is just an 
artifact of the truncations. Also, Dti decreases as the recon- 
struction subspace increases in dimension. This demonstrates 
the advantages of the MLME scheme over the ML method. 

If the true state lies outside the subspace of interest, 
then the estimated value of Wqo can drastically deviate 
from the true value. It is clear that a truncation of the 
Hilbert space into a smaller reconstruction subspace can 
lead to diagnostics which are highly incompatible with 
the true result. So, if one is interested in performing an 
objective quantum tomography experiment on a given 
ensemble of quantum systems with some prior knowledge 
regarding its true state, an option would be to recon- 
struct the MLME estimator in the largest possible sub- 
space based on this prior knowledge. By enlarging the re- 
construction subspace, many more admissible states are 
taken into consideration and more reliable state estima- 
tions and quantum diagnostics can thus be performed. 
We now have an operational reconstruction scheme that 
combines our knowledge and ignorance about the un- 
known true state to give us a unique state estimator in 
an objective way. 

There exists an interesting structure in these MLME 
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estimators and to explore it, one needs some knowledge 
on the structure of the POM used and its influence on 
the D-dimensional Hilbert space. Suppose the set of 
K POM elements Hj are informationally incomplete. A 
consequence of this is that the number of linearly inde- 
pendent IljS is less than D^. To determine their linear in- 
dependence, we can look for the eigenvalues of the K x K 
Gram matrix M with the matrix elements defined as 

M,,=tr{n,nfe}. (7) 

The number of positive eigenvalues n>o < of M deter- 
mines the number of linearly independent measurement 
outcomes. The largest value of n>o is since this is 
the maximum number of linearly independent operators 
spanning the space of Hcrmitian operators as a basis. 
Hence a set of informationally incomplete IljS acting on 
the D-dimensional Hilbert space is such that n>o < D^. 

Any D-dimensional positive operator can be repre- 
sented by a set of Hcrmitian basis operators Tj = P^ 
satisfying the trace-orthonormality condition trjPjPfe} = 
Sjk- For dimension two, an example of such a basis is the 
the familiar set of four operators l/\/2, ax/V^, ay/V^ 
and Ozl^pl- Once the number of independent measure- 
ment outcomes rt>o is known, one can construct a set 
{Pj}"^° of n>o trace-orthonormal Hcrmitian basis oper- 
ators directly from the K POM elements. In other words, 
each of the K POM elements can be expressed as a linear 
combination of the n>o basis operators 

H,=^a,,Pfe, (8) 
fc=i 

where all coefficients Oj-fe are real. This implies that the 
n>o-dimensional subspace is spanned by the basis oper- 
ators that uniquely specify the POM outcomes. We will 
coin this subspace the measurement subspace. The rest 
of the — n>o Hcrmitian basis operators, which are 
trace-orthonormal to the previous set and span the sub- 
space, that is complement to the measurement subspace 
can also be constructed. 

Suppose a state estimator pml is generated using the 
ML procedure on a set of measurement data obtained 
from the POM outcomes Hj. We can represent this es- 
timator by a set of Hcrmitian trace-orthonormal basis 
operators inasmuch as 

ri>o 

PML = E^rr,+ ^rr... (9) 

k—1 fc— n>o + l 

= PML =Pmb 

The part pml resides in the measurement subspace, 
which is spanned by the measurement outcomes Hj giv- 
ing the measurement data, and is uniquely fixed for all 
ML estimators by the ML procedure for the same set of 



measurement data. The part pme resides in the comple- 
mentary subspace, which is orthogonal to the measure- 
ment subspace, and thus does not contribute to the PjS. 
In other words, trlpMElfj} = and this can imply the 
existence of a family of pmes that gives the same set of 
ML probabilities as long as the /5mls are positive. 

Therefore, the MLME scheme can be understood as an 
optimization over the complementary subspace to max- 
imize S{p) under the constraint pmlme > 0. However, 
one notes that only certain sets of c^^s are allowed dur- 
ing the optimization due to this positivity constraint. 
This is especially important when pmlme is rank defi- 
cient and lies on the boundary of the state space. Geo- 
metrically, the plateau of most-likely states is generally a 
much smaller subspace contained in the complementary 
subspace. In some cases, this plateau contains a sin- 
gle ML estimator due to the positivity constraint even 
when the measurements are informationally incomplete. 
In general, the boundary of the plateau is complicated 
and deserves further study. 

In summary, we have developed a state reconstruction 
scheme which is applicable to any set of measurements, 
particularly those which are informationally incomplete, 
as in homodyne tomography for instance. We emphasize 
that in order to carry out least-bias state reconstructions 
and quantum diagnostics on an ensemble of quantum sys- 
tems, a good way is to do this over a large subspace of 
states compatible with some prior knowledge on the en- 
semble in order to avoid inaccurate results which can have 
detrimental effects on statistical predictions. We then ex- 
plored the geometrical structure of the state estimators 
obtained from this scheme and gave an alternative un- 
derstanding to the state reconstruction procedure. 
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